Docket No.: COGNITA-OOIXX 

SELF-ORGANIZING NEURAL MAPPER 

CROSS-REFERENCE TO RELATED APPLICATIONS 
[0001] This application claims benefit of U.S. Provisional 
Patent Application No. 60/396,109. 

TECHNICAL FIELD 

[0002] This invention relates to knowledge management and more 
particularly, to artificial learning as applied to the creation, 
representation, and subsequent retrieval of information within a 
singular or distributed knowledge database. 

BACKGROUND INFORMATION 
[0003] The purpose of this invention, and knowledge management 
in general, is to help provide relevant solutions to questions 
and problems that have been solved before. An encyclopedia is, 
for example, a primitive knowledge management system: it 
provides a simple way for people to find information. In the 
case of the encyclopedia, the information is prepared, 
categorized, and cross-indexed to help people find the 
information more effectively. 
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[0004] While the inf ormatiort in-' an encyclopedia is static, 
computers allow us to collect and store large amounts of 
rapidly-changing information. So much information, in fact, 
that the ability to locate relevant answers becomes a critical, 
but challenging problem. Companies and organizations face a 
similar challenge in trying to provide employees, business 
partners, and customers with information about products and 
processes. Web sites and email are often used to communicate 
dynamic information, but can't provide a single point of access 
and have only increased the information-access challenge. 
[0005] Prior and existing attempts to solve this problem have 
resulted in two basic approaches to knowledge management: the 
user-burdened approach and the provider-jburdened approach. 
[0006] Applications implementing the user-burdened approach 
depend on highly-automated systems to gather, categorize, and 
present information in a format that puts the burden of the work 
on the person doing the searching. The user generally enters 
keywords using a specific syntax that often requires quotes or 
boolean symbols, such as ''AND" and "OR," and is presented with 
a list of possible results. The user must then modify their 
search to narrow the list, requiring that the user understand 
how to best present and improve their search criteria. This type 
of knowledge system will always return the same results for a 
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search unless something is added or removed from the knowledge 
database. Most web search engines are ''user-burdened" systems. 
[0007] The provider-burdened approach makes use of a human 
content management team, possibly assisted by some automated 
categorization technology, to manually organize the information 
to make it easier to find. This categorization process is time- 
consuming and resource-intensive, resulting in a system that is 
easier for users to search than user-burdened systems, but much 
more expensive to maintain.. 

[0008] A common problem with provider-burdened systems, is that 
it is difficult and expensive to keep knowledge up-to-date. 
Technologies in this category range from case-based reasoning 
(CBR) tools (reference to Inference patent) to most cognitive 
processing tools (see patent #5,797,135 Whalen et al.). This 
type of knowledge system is most frequently implemented by large 
companies and corporations, that can afford the cost and 
manpower required to create and maintain the content. 
[0009] Neither of these approaches is satisfactory on its own. 
One is difficult to use and the other is costly to maintain. 
There are currently few, if any, knowledge solutions that are 
both easy to search and cost-effective to maintain. Most 
existing technologies focus on identifying implied meaning by 
organizing the content or applying decision tree or other 
lexical technologies to the questions submitted. They try to 
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match a search to an answer b^sed upon the terms or the meanings 
found in the answer itself. 

SUMMARY 

[0010] The impetus to the Self -Organising Neural Mapper (SONM) 
technology according to the present invention is a result of 
study and use of many of the prior art technologies, and the 
less-then successful attempts of most organizations and 
companies to implement them. The concept of SONM itself is 
based upon the works of computer pioneer Alan Turing in machine 
intelligence and of linguist and scientist Marvin Minsky in 
human learning, as well as upon the inventor's own studies in 
linguistics and human communication. The goal was to develop an 
engine that would remove the burden from both the user and 
provider by learning how to provide answers based upon how 
previous questions were asked. 

[0011] The benefits of the present invention are a system that 
is: Easy to create. Those with knowledge to share need only 
concern themselves with the actual content, rather than its 
structure and formatting. ■ Easy to maintain. No expensive 
knowledge engineers are required to prepare, categorize, and 
organize content or build fancy decision trees. Easy to search, 
users can ask questions in the way the makes the most sense to 
them, without resorting to quote marks or confusing boolean 
logic. Reusable. If one person has a question, it's likely 
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others will have the same question. The more an answer is used, 
the easier it becomes to find. Self -Improving. The present 
invention leverages user feedback and behavior to strengthen or 
weaken the association between • a specific answer and the 
original question. 

[0012] The net benefit of the present invention is a technology 
that is extremely inexpensive to implement and use, and that 
becomes more useful as people use it to create, access, and 
apply information. 



BRIEF DESCRIPTION OF THE DRAWINGS 
[0013] These and other features and advantages of the present 
invention will be better understood by reading the following 
detailed description, taken together with the drawings wherein: 
[0014] FIG. 1 is a block diagram of the present invention; 
[0015] FIG. 2 is an object class diagram that illustrates the 
objects and relationships necessary to implement a system and 
method according to the present invention; 

[0016] FIG. 3 shows the steps required to create a new answer in 
the database; 

[0017] FIG. 4 shows the steps required when searching for an 
answer; 
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[0018] FIG. 5 shows the steps * required to learn from the search; 
[0019] FIG. 6 shows how a new piece of content, or memory, is 
added to the knowledge database including neurons and strengths; 
[0020] FIG. 7 shows what a memory looks like in the database 
with other memories. 

[0021] FIG. 8 shows how a search, or query, finds and 
prioritizes answers; and 

[0022] FIG. 9. shows how memories are strengthened and weakened 
after the system receives feedback from the user. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
[0023] Implementation of the present invention 10, Fig. 1, 
requires a minimum of a relational database engine 12 and a 
small program to implement the logic 14 for the Teaching Engine 
14, the Searching Engine 16 and the Learning Engine 18. The 
relational database engine, or RDBMS 12, can be one of any 
number of commercial or free offerings, or can be developed as 
part of the application logic 14 itself. The implementation 
logic 14 can be written using any appropriate programming 
language, can be implemented in hardware, or can be implemented 
using RDBMS structures such as stored procedures and triggers. 
[0024] The Teaching Engine 14 database schema consists of three 
related tables with the following specifications: 
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[0025] The Answer Table 20 contains summaries and answers; the 
actual knowledge in the database or a pointer to where it can be 
found, and includes the following fields: 
Field: Id: Unique identifier for each answer. 

Field: Summary: Brief description of . the answer or question 
being answered. 

Field: Detail: The full answer. This can be in any format, 
although it will typically be implemented in HTML, XML or SGML. 
[0026] The Symbol Table 22 contains the unique symbols used by 
the search engine to match a query with an answer and includes 
the following field: 

Field: Name (a text string or a link to an external multimedia 
object, such as an image or sound) . 

[0027] The Neuron Table 24 contains the neuron objects that link 
specific symbols with specific answers, and include the 
following fields: 

Field: Id: Link to the ID field in the Answer table. 
Field: Name: Link to the Symbol field in the Symbol table. 

Field: Strength: Weight of this neuron. 
[0028] The Searching Engine database schema 16 typically 
includes three related tables with the following 

specifications : 

[0029] The Query Table 26 contains a list of user queries. For 
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every new search, an entry is created in the Query Table, and 
remains until the query is resolved. The Query table includes 
the following field: 

Field: Id: Unique identifier for each attempted search. 

[0030] The Stimulus Table 28 contains the stimulus objects that 

will be compared against symbols to locate the most probable 

answers. The table includes the following fields: 

Field: Name (a text string or a link to an external multimedia 

object, such as an image or sound) . 

Field: Query_Id: Link to ID field in the Query table. 

[0031] The Decision Table 30 contains the list of possible 

answers for a given search and typically includes: 

Field: Query_Id: Link to ID field in the Query table. 

Field: Answer_Id: Link to the ID field in the Answer table. 

[0032] The Learning Engine database schema 18 requires no 

additional tables. It acts upon and utilizes several existing 

tables (specified in the Teaching and Searching Engines) 

including: 

The Stimulus Table 28 

Used to identify which neurons need to be positively or 
negatively reinforced. 
The Neuron Table 24 

Positively or negatively reinforced, depending upon feedback 
from the searcher. 
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The Query Table 26 

The given query is removed from the Query Table after the 
query has been resolved (feedback has been received or 
sufficient time has passed to assume that it won't be received.) 
The Decision Table 30 

The given query is removed from the Query Table after the 
query has been resolved (feedback has been received or 
sufficient time has passed to assume that it won't be received.) 
[0033] The Teaching Engine logic 14 consists of one or more 
steps or acts required to accept a new answer and create or link 
between the symbols, answers and neurons required by the 
Searching and Learning Engines. The acts are as follows: 
[0034] A. The answer (Summary and Detail) is supplied by a user 
or programmer interface (hereafter referred to as ''the agent'') 
32 and is added to the Answer Table 20, act 100, Figs. 2 and 3. 
A unique ID is generated, act 102, for the ID field in the 
Answer Table. 

[0035] The Summary (and optionally, the Detail) is parsed into 
symbols using the following rules: (1) All non-alphanumeric 
characters are converted to "space" characters (or some other 
non-alphanumeric character) . Depending on the locale, non- 
alphanumeric characters that are generally considered part of a 
word (e.g., in English, the apostrophe) and are generally not 
converted. (2) Each space-delimited character-grouping is 
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converted to upper case and i3 termed a ''symbol", act 103. 

[0036] Each ''symbol" generated above that does not exist in the 

Symbol Table is added to the Symbol Table, act 104. 

[0037] An entry is created in the Neuron Table that links the 

name of each existing or newly-added symbol in the symbol table 

to the ID of the newly-added answer, act 106. The neuron 

strength is set to a default value. 

[0038] The Searching Engine logic 16 consists of certain acts 
required to accept query, to create the required temporary 
search structures, to provide a list of possible answers, to 
display the specific answers when they are selected, and to 
solicit feedback about the usefulness of each selected answer, 
see Fig. 4. The acts are as follows: 

[0039] The search text is supplied by the agent and is assigned 
an ID and added to the Query Table, act 108. 

[0040] The search text is parsed into stimuli in exactly the 
same manner that answers are parsed into symbols, act 110, as 
described in act 104 above. Each stimulus is added to the 
Stimulus Table and linked to the original query using the 
Query_Id field in the Stimulus Table. 

[0041] Each Stimulus is compared against the the Symbol Table, 
act 112. If a matching symbol exists, all Answers linked to that 
symbol via a neuron are written into the Decision Table, thereby 
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linking those answers to the original query, act 114. 
[0042] Each answer written to the Decision Table (linked to the 
original query) is assigned a weight equal to the sum of the 
strengths of each neuron that matches one of the query's 
stimuli. The answer summaries (i.e., the Decision List) are 
sorted and presented to the agent in order of descending 
strength, act 116. 

[0043] When the agent selects an answer from the decision list, 
the full answer detail is displayed. The agent is then given the 
opportunity to provide feedback stating whether or not the 
displayed answer was relevant. This step is repeated for each 
answer the agent selects from the decision list, act 118. 
[0044] The Learning Engine logic 18 includes those acts required 
to positively and negatively reinforce neurons after an answer 
has been selected by an agent and feedback has been provided. 
Fig. 5. The acts include: 

[0045] If the agent specifies that a specific answer was useful 
to them, act 120: 

A. The strength of each neuron linking that answer to a symbol 
that matches a stimulus from the original query is increased 
(positively reinforced) by a predetermined value, act 122. The 
amount it is increased may be a constant default value (such as 
10) or it may be relative to the average neural strength in the 
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system (such as 1.2 multiplied by 'the average). 

B. The strength of each neuron linking that answer to a symbol 
that does not match any stimulus from the original query is 
decreased (negatively reinforced) 'by a* predetermined value, act 
124. The value used for negative reinforcement will generally be 
a small fraction of the value used for positive reinforcement. 
C- The strength of each neuron linking an unselected answer to 
a symbol that matches a stimulus from the original query is 
decreased (negatively reinforced) by a predetermined value, act 
126. 

[0046] If the user or programmer interface states that a 
specific answer was not useful to them, act 130, each neuron 
linked to the selected answer and also to a symbol that matches 
a stimulus from the original query has its strength decreased 
(negatively reinforced) by a predetermined value, act 132. 
[0047] A typical SONM system will have an optimal strengthening- 
to-weakening ratio that may be determined by observing the 
system in action. This ratio is optimal when both the average 
and maximum neural strength values stabilize and do not change 
significantly over time. 

[0048] Several additions can be used to extend the base 
functionality of the disclosed embodiment of the present 
invention. First, the values for positive and negative 
reinforcement can be determined statistically, based upon 
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reinforcement history and the current average values of the 
affected and unaffected neurons. Dynamic, statistically- 

generated values for positive and negative reinforcement will 
create a self-optimizing feedback ' loop, more effectively 
differentiating between useful and less useful neurons. 
[0049] Second, several additional types of neurons can be 
introduced. In the base embodiment, neurons are used to relate 
symbols and answers. Similar logic can be employed to relate 
symbols to other symbols, allowing the search to account for the 
proximity of symbols to each other, and to relate answers to 
other answers, identifying answers that are similar or related 
to the selected answer. 

Example of adding a new answer using the Teaching Engine: 

[0050] It is desired to add an answer to the database describing 
what to do if your greyhound is cold. The answer consists of 
two parts: the question being answered, or summary, and the 
actual answer to the question, or detail. Note: To simplify the 
example, we will use only the summary for the initial teaching, 
although it is often preferable to include the detail as well. 
Summary: ''My greyhound is cold.'' 
Detail: ''Put a coat on your dog.'' 

[0051] The summary and detail are added to the database and the 
resulting answer is assigned a unique identification. The 
summary is then broken into discrete symbols. 
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Answer ID: #1 

Symbols: Table 1 below 

Symbol 
. MY . 

GREYHOUND 
IS 
COLD 



[0052] Each symbol is then linked to the answer by a neuron. The 

neuron contains a reference to the answer, a reference to the 

symbol, and a number representing the strength of the 
relationship between them. (See 24, Figure 2) . 

Neurons: Table 2 



J^swer ID 


Symbol 


S'treng'th 


#1 


MY 


100 


#1 


GREYHOUND 


100 


#1 


IS 


100 


#1 


COLD 


100 



Summarizing: 

[0053] Each symbol must be unique (I.e., there may be only one 
instance of the symbol ''GREYHOUND" in the system. Any number of 
neurons can refer to a specific symbol. 
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[0054] Each answer may or may not be unique. Any number of 
neurons can refer to a specific answer. See Fig. 7. 

[0055] Each neuron must be unique; a specific answer may be 
linked to a specific symbol by one and only one neuron. See 
Fig. 7. 

Example of searching for an answer using the Searching Engine: 

[0056] A user would like to find out what to do if his or her 
dog is cold. He enters search using one of the system 
interfaces : 

Query: "My dog is cold.'' 

[0057] The guery is parsed into stimuli in exactly the same 
manner in which an answer is broken into symbols, except that 
stimuli are linked directly to the query; no neurons are 
involved. See Fig. 8. 

Stimuli: Table 3 

Stimuli 

_ 

DOG 
IS 

COLD 



[0058] Each stimulus is checked against the symbols list. If 
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there is a symbol that matches the stimulus, every answer 
linked to that symbol is added to a decision list as a possible 
solution. In this example, the stimuli MY, IS, and COLD match 
symbols linked to the above answer, so that answer is added to 
the decision list. See Fig. 8. 

[0059] When all the stimuli have been checked, each answer in 
the decision list is assigned a weight equal to the sum of the 
strengths of all the neurons that link that answer with a symbol 
that matches one of the stimuli. If the neurons linking MY, IS, 
and COLD with thought #1 each have a strength of 100 (the actual 
assignment and adjustment of neuron strengths is discussed 
herein) , the overall weight of that particular decision in the 
list is 300. The user is then presented with a list of answer 
summaries, sorted in descending order of weight. 

Example of learning from an answer using the Learning Engine: 

[0060] If a specific answer is selected from the decision list 
and validated as having been useful (see Fig. 9) : 

A. All neurons linking the selected answer to symbols 
that match stimuli are strengthened by increasing their 
strength value. The amount they are strengthened may be a 
constant default value (such as 10) or may be relative to 
the average neural strength in the system (such as 1.2 
multiplied by the average) . 
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B. All stimuli that do not already exist as symbols, are 
added as symbols and linked, via new neurons, to the 
selected answer. Each newly created neuron is assigned the 
default strength. 

C. All neurons linking the selected answer to symbols 
that do not match stimuli are weakened by decreasing their 
strength value. The amount they are weakened is generally 
a small fraction of the strengthening value described in 
item 1, above. 

D. All neurons linking the unselected answers to symbols 
that match stimuli are weakened. 

[0061] A typical system consructed in accordance with the 
teachings of the present invention will have an optimal 
strengthening-to-weakening ratio that may be determined by 
observing the system in action. This ratio is optimal when both 
the average and maximum neural strength values stabilize and do 
not change significantly over time. 

[0062] As a result of these changes, the system of the present 
invention learns to associate new search terms (symbols) with 
answers based upon stimuli present in the questions asked. In 
the previous example, the stimulus ''dog'' has now been added as a 
symbol and linked to this answer. Future searches that include 
^'dog'' as a stimulus will result in this answer being presented 
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in the decision list. Further, stimuli that are frequently 
helpful in a particular search become more likely to impact the 
decisions listed in the future, while stimuli that are less 
helpful become less likely to impact the decisions listed. 
[0063] One benefit of the present invention is a technology that 
is extremely inexpensive to implement and use, and that becomes 
more useful as agents use it to create, access, and apply 
information. 

[0064] In general, the present invention has many advantages 
over existing solutions: 

A. It is simple to build a system based upon this technology. 
The underlying structure and logic are fundamentally simple and 
easy to implement. 

B. It is simple to add new knowledge to the database, ensuring 
that information can be easily collected when it is most 
relevant. No special formatting or organization of the 
knowledge is required, meaning little or no special training is 
required in order to contribute knowledge. 

C. It is simple to locate and change knowledge in the 
database, ensuring that information can be kept up-to-date. A 
particular knowledge item can be located by unique ID, or it 
can be located using the search portion of the invention. 
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D. It is simple to remove dated or obsolete knowledge, 
ensuring that obsolete information does not become confused 
with current information. Dated or obsolete knowledge can be 
located by unique ID, or it can be located using the search 
engine portion of the invention. In addition, little-used 
knowledge will inherently have very weak neurons (see Figure 5) 
and can be easily identified using basic database reporting 
techniques . 

E. It is simple to search for knowledge without the need for a 
quoted or Boolean syntax. The invention optimizes the search 
automatically, and uses the results of previous searches to 
learn how agents are likely to word future searches. 

F. Search effectiveness can be optimized for specific 
applications by adjusting the algorithms used for strengthening 
and weakening memories. 

G. Teaching engine does not require a specific interfaces- 
knowledge can be added by people in response to what they know, 
or by automated systems in response to events in their 
environment . 

H. Searching/Learning engine does not require a specific 
interface; knowledge can be searched for by people in response 
to a question or problem, or by automated systems in response 
to events in their environment. 
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[0065] Modifications and substitutions by one of ordinary skill 
in the art are considered to be within the scope of the present 
invention, which is not to be limited except by the following 
claims. 
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